Effect of Sampling for Multi-set Cardinality Estimation
نویسندگان
چکیده
منابع مشابه
Simple set cardinality estimation through random sampling
We present a simple algorithm for estimating the cardinality of a set I, based on a RandomSample(I) primitive that returns an element of I uniformly at random. Our algorithm with probability (1−perr) returns an estimate of |I| accurate within a factor (1 ± δerr) invoking RandomSample(I) at most O (
متن کاملEstimation of Variance of Normal Distribution using Ranked Set Sampling
Introduction In some biological, environmental or ecological studies, there are situations in which obtaining exact measurements of sample units are much harder than ranking them in a set of small size without referring to their precise values. In these situations, ranked set sampling (RSS), proposed by McIntyre (1952), can be regarded as an alternative to the usual simple random sampling ...
متن کاملDistributed Set Expression Cardinality Estimation
We consider the problem of estimating set-expression cardinality in a distributed streaming environment where rapid update streams originating at remote sites are continually transmitted to a central processing system. At the core of our algorithmic solutions for answering set-expression cardinality queries are two novel techniques for lowering data communication costs without sacrificing answe...
متن کاملPSALM: Accurate Sampling for Cardinality Estimation in a Multi-user Environment
In database systems that support fine-grained access controls, each user has access rights that determine which tuples are accessible and which are inaccessible. Queries are answered as if the inaccessible tuples are not present in the database. Thus, users with different access rights may get different answers to a given query. To process queries efficiently in the presence of fine-grained acc...
متن کاملCardinality Estimation Done Right: Index-Based Join Sampling
After four decades of research, today’s database systems still suffer from poor query execution plans. Bad plans are usually caused by poor cardinality estimates, which have been called the “Achilles Heel” of modern query optimizers. In this work we propose indexbased join sampling, a novel cardinality estimation technique for main-memory databases that relies on sampling and existing index str...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: KIPS Transactions on Computer and Communication Systems
سال: 2015
ISSN: 2287-5891
DOI: 10.3745/ktccs.2015.4.1.15